Display apparatus with intelligent user interface

ABSTRACT

A display apparatus includes presence detection circuitry for detecting an individual in proximity to the display apparatus; a display for displaying video content and a user interface; a processor in communication with the user input circuitry, the display, and the search history database; and non-transitory computer readable media in communication with the processor that stores instruction code. The instruction code is executed by the processor and causes the processor to: a) determine, from the presence detection circuitry, a user in proximity of the display apparatus; b) determine one or more program types associated with the user; c) determine available programs that match the predicted one or more programs; and d) update the user interface to depict a listing of one or more of the available programs that match the predicted one or more programs.

CROSS REFERENCE

This application is a continuation of and claims priority to the U.S.patent application Ser. No. 15/985,325, filed on May 21, 2018, theentirety of which is herein incorporated by reference.

BACKGROUND Field

This application generally relates to a display apparatus such as atelevision. In particular, this application describes a displayapparatus with an intelligent user interface

Description of Related Art

The current breed of higher end televisions typically include networkconnectivity to facilitate streaming video content from content serverssuch as Netflix®, Hulu®, etc. In some cases, the televisions utilizeoperating systems such as Android® that facilitate execution of apps forother purposes.

Access to the ever-increasing number of new features requires changes tothe user interface. Unfortunately, access to these newer features oftentimes results in user interfaces that are frustratingly complex anddifficult to navigate.

SUMMARY

In first aspect, a display apparatus includes a user input circuitry forreceiving user commands and a display for outputting video content and auser interface. The video content includes metadata. The apparatus alsoincludes a processor in communication with the user input circuitry andthe display, and non-transitory computer readable media in communicationwith the processor that stores instruction code. The instruction code isexecuted by the processor and causes the processor to receive, from theuser input circuitry, a first scene command to search for scenes in thevideo content of a scene type. The processor determines, from themetadata, one or more scenes in the video content related to the scenetype. The processor then updates the user interface to depict one ormore scene images related to the one or more scenes related to the scenetype.

In a second aspect, a method for controlling a display apparatusincludes receiving, via user input circuitry, user commands, outputting,via a display, video content and a user interface. The video contentincludes metadata. The method includes receiving, from the user inputcircuitry, a first scene command to search for scenes in the videocontent of a scene type; determining, from the metadata, one or morescenes in the video content related to the scene type; and updating theuser interface to depict one or more scene images related to the one ormore scenes related to the scene type.

In a third aspect, a non-transitory computer readable media that storesinstruction code for controlling a display apparatus is provided. Theinstruction code is executable by a machine for causing the machine toreceive, from user input circuitry, a first scene command to search forscenes in the video content of a scene type; determine, from metadata ofvideo content, one or more scenes in the video content related to thescene type; and update a user interface to depict one or more sceneimages related to the one or more scenes related to the scene type.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an exemplary environment in which a display apparatusoperates;

FIG. 2 illustrates exemplary operations for enhancing navigation ofvideo content.

FIGS. 3A-3C illustrate exemplary user interfaces that may be presentedto a user during the operations of FIG. 2;

FIG. 4 illustrates exemplary operations that facilitate locating aparticular type of video content;

FIG. 5 illustrates an exemplary user interface that may be presented toa user during the operations of FIG. 4;

FIG. 6 illustrates exemplary operations for determining informationrelated to images in video content.

FIGS. 7A and 7B illustrate exemplary user interfaces that may bepresented to a user during the operations of FIG. 6;

FIG. 8 illustrates alternative exemplary operations for determininginformation related to images in video content;

FIGS. 9A and 9B illustrate exemplary user interfaces that may bepresented to a user during the operations of FIG. 8;

FIG. 10 illustrates alternative exemplary operations for automaticallypausing video content;

FIGS. 11A and 11B illustrate exemplary user interfaces that may bepresented to a user during the operations of FIG. 10;

FIG. 12 illustrates alternative exemplary operations for automaticallypausing video content;

FIGS. 13A-13D illustrate exemplary user interfaces that may be presentedto a user during the operations of FIG. 12;

FIG. 14 illustrates exemplary operations for adjusting various smartappliances based on a detected routine of a user;

FIGS. 15A-15B illustrate exemplary user interfaces that may be presentedto a user during the operations of FIG. 14; and

FIG. 16 illustrates an exemplary computer system that may form part ofor implement the systems described in the figures or in the followingparagraphs.

DETAILED DESCRIPTION

The embodiments described below are directed to various user interfaceimplementations that facilitate access to television features in anintelligent, easy to use manner. Generally, the user interfaces rely onvarious machine learning techniques that facilitate access to thesefeatures and other information with a minimum number of steps. The userinterfaces are configured to be intuitive, with minimal learning timerequired to become proficient in navigating the user interfaces.

FIG. 1 illustrates an exemplary environment in which a display apparatusoperates. Illustrated are, the display apparatus 100, a group of mobiledevices 105, a GPS network 110, a computer network 115, a group ofsocial media servers 120, a group of content servers 125, a supportserver 127, and one or more users that may view and/or interact with thedisplay apparatus 100. The display apparatus 100, social media servers120, content servers 125, and support server 127 may communicate withone another via a network 107 such as the Internet, a cable network, asatellite network, etc.

The social media servers 120 correspond generally to computer systemshosting publicly available information that may be related to the users130 of the display apparatus 100. For example, the social media servers120 may be Facebook®, Twitter®, LinkedIn®, etc. The social media servers120 may include blogs, forums, and/or any other systems or websites fromwhich information related to the users 130 may be obtained.

The mobile devices 105 may correspond to mobile phones, tablets, etc.carried by one or more of the users 130. The mobile devices 105 mayinclude short range communication circuitry that facilitates directcommunication with the display apparatus 100. For example, the mobiledevices 105 may include Bluetooth® circuitry, nearfield comminutioncircuitry, etc. The communication circuitry facilities detection of of agiven mobile device 105 when it is in the proximity of display apparatus100. This in turn may facilitate determination, by the display apparatus100, of the presence of a user 130 within viewing distance of thedisplay apparatus 100.

The GPS network 110 and computer network 115 may communicate informationto the display apparatus 100 that may in turn facilitate determination,by the display apparatus 100, of the general location of displayapparatus 100. For example, the GPS network 110 may communicationinformation that facilitates determining a relatively precise locationof the display apparatus 100. The computer network 115 may assign an IPaddress to the display apparatus 100 that may be associated with ageneral location, such as a city or other geographic region.

The content servers 125 correspond generally to computer systems hostingvideo content. For example, the content servers 125 may correspond tohead-end equipment operated by a cable television provider, networkprovider, etc. The content servers 125 may in some cases store videocontent such as movies, television shows, sports programs, etc.

In some cases, video content may include metadata that defines variousaspects of the video content. For example, metadata associated with asports matchup may include information timestamps, still images, etc.related to various events of the match, such as goals, penalties, etc.The metadata may include information associated with differentindividuals depicted in the video content such as the names of players,coaches, etc.

The metadata in the video content may include information thatfacilitates determining whether the video content is of a particulartype (e.g., comedy, drama, sports, adventure, etc.). The metadata mayinclude information associated with different individuals depicted inthe video content such as the names of actors shown in the videocontent. The metadata may include information associated with differentobjects depicted in the video content such as garments worn byindividuals, personal items carried by the individuals, and variousobjects that may be depicted in the video content.

The metadata may have been automatically generated beforehand by variousmachine learning techniques for identifying individuals, scenes, events,etc. in the video content. In addition or alternatively, the machinelearning techniques may use some form of human assistance in making thisdetermination.

The support server 127 corresponds generally to computer systemconfigured to provide advanced services to the display apparatus 100.For example, support server 127 may correspond to high-end computer thatconfigured to perform various machine learning technique for determiningthe meaning of voice commands, predicting responses to the voicecommands, etc. The support server 127 may receive voice commands andother types of commands from the display apparatus 100 and communicateresponses associated with the commands back to the display apparatus

The display apparatus 100 may correspond to a television or otherviewing device with enhanced user interface capabilities. The displayapparatus 100 may include a CPU 150, a video processor 160, an I/Ointerface 155, an AI processor 165, a display 175, a support database153, and instruction memory 170.

The CPU 150 may correspond to processor such as an Intel®, AMD®, ARM®,etc. based processor. The CPU 150 may execute an operating system, suchas Android®, Linux®, or other operating system suitable for executionwithin a display apparatus. Instructions code associated with theoperating system and for controlling various aspects of the displayapparatus 100 may be stored within the instruction memory 170. Forexample, instruction code stored in the instruction memory 170 mayfacilitate controlling the CPU 150 to communicate information to andfrom the I/O interface 155. The CPU 150 may process video contentreceived from the I/O interface 155 and communicate the processed videocontent to the display 175. The CPU 150 may generate various userinterfaces that facilitate controlling different aspects of the displayapparatus.

The I/O interface 155 is configured to interface with various types ofhardware and to communicate information received from the hardware tothe CPU. For example, the I/O interface 155 may be coupled to one ormore antenna's that facilitate receiving information from the mobileterminals 105, GPS network 110, WIFI network 115, smart appliances 117,etc. The I/O interface may be coupled to a imager 151 arranged on theface of the display apparatus 100 to facilitate capturing images ofindividuals near the display apparatus. The I/O interface may be coupledto one or more microphones 152 arranged on the display apparatus 100 tofacilitate capturing voice instructions that may be conveyed by theusers 130.

The AI processor 165 may be correspond to a processor specificallyconfigured to perform AI operations such as natural language processing,still and motion image processing, voice processing, etc. For example,the AI processor 165 may be configured perform voice recognition torecognize voice commands received through the microphone. The AIprocessor 165 may include face recognition functionality to identifyindividuals in images captured by the camera. In some implementations,the AI processor 165 may be configured to analyze content communicatedfrom one or more content servers to identify objects within the content.

Exemplary operations performed by the CPU 150 and/or other modules ofthe display apparatus 100 in providing an intelligent user interface areillustrated below. In this regard, the operations may be implemented viainstruction code stored in non-transitory computer readable media 170that resides within the subsystems configured to cause the respectivesubsystems to perform the operations illustrated in the figures anddiscussed herein.

FIG. 2 illustrates exemplary operations for enhancing navigation ofvideo content. The operations of FIG. 2 are better understood withreference to FIGS. 3A-3C.

At block 200, the display apparatus 100 may be depicting video content,such as a soccer match, as illustrated in FIG. 3A. The user 130 may thenissue a first scene command 305 to the display apparatus 100 to have thedisplay apparatus 100 search for scenes in the video content. Forexample, the user 130 may simply speak out loud, “show me all thegoals.” In this case, the natural language processor implemented by theCPU 150 alone or in cooperation with the AI processor 165 may determinethe meaning of the voice command. In addition or alternatively, dataassociated with the voice command may be communicated to the supportserver 127 which may then ascertain the meaning of the voice command andconvey the determined meaning back to the display apparatus.

As illustrated in FIG. 3A, in some implementations, the user interface300 may include a phrase control 310 that is updated in real-time todepict text associated with the commands issued by the user.

At block 205, in response to the first scene command 305, the displayapparatus 100 may determine scenes in the video content that are relatedto a type of scene associated with the first scene command 305. In thisregard, the CPU 150 alone or in cooperation with the AI processor 165may implement various machine learning techniques that utilize metadataassociated with the video content to determine scenes in the videocontent that are related to the scene type. In addition oralternatively, the first scene command 305 may be communicated to thesupport server 127 and the support server 127 may determine and conveythe scene type to the display apparatus 100.

At block 210, the user interface 300 of the display apparatus 100 may beupdated to depict scene images 320 associated with the determinedscenes. For example, images 320 from the video content metadataassociated with the scenes may be displayed on the user interface 300.The images 320 may correspond to still images and/or a sequence ofimages or video associated with the scene.

In some implementations, the user interface 300 may be updated todisplay unique identifiers 325 on or near each image.

At block 215, the user 130 may specify a second scene command thatspecifies one of the unique identifiers 325. For example, the user 130may specify “three” to select the scene associated with the third image320.

At block 220, video content associated with the specified uniqueidentifier 325 (e.g., “three”) may be displayed on the user interface300, as illustrated in FIG. 3C.

Returning to block 200, in some implementations, the user 130 may refinea scene command by specify additional information. For example, inresponse to receiving the first scene command 305 at block 200, at block225 one or more potential scene commands 315 related to the first scenecommand 305 may be determined. The machine learning techniquesimplemented by the CPU 150, AI processor 165, and/or the support server127 may be utilized to determine the potential scene commands related tothe first scene command 305. In this regard, the metadata in the videocontent may define a hierarchy of scene commands utilized by the machinelearning techniques in determining potential scene commands related to agiven first scene command 305.

At block 230, the user interface 300 may be updated to depict one ormore of the potential scene commands 315, as illustrated in FIG. 3A. Forexample, in response to the first scene command 305 “show me all thegoals,” the potential scene commands “in the first half”, “by realMadrid”, etc. may be determined and depicted.

At block 235, the user 130 may issue one of the potential scene commands315 to instruct the display apparatus 100 to search for scenes in thevideo content, as illustrated in FIG. 3B. For example, the user 130 maysimply speak out loud, “in the first half.” The phrase control 310 maybe updated in real-time to depict text associated with the first scenecommand 305 and the second scene command 320.

The operations may repeat from block 205. For example, in response tothe second scene command 320, the display apparatus 100 may determinescenes in the video content that are related to a type of sceneassociated with the first scene command 305 and the second scene command320. In addition or alternatively, the first scene commands 305 and thesecond scene command 320 maybe conveyed to the support server 127 andthe support server 127 may convey information that defines relatedscenes to the display apparatus.

It should be understood that additional scene commands beyond the firstand second scene commands may be specified to facilitate narrowing downdesired content. For example, after issuance of the second scene command320, another group of potential scene commands 315 may be depicted, andso on.

FIG. 4 illustrates exemplary operations that facilitate locating aparticular type of video content. The operations of FIG. 4 are betterunderstood with reference to FIG. 5.

At block 400, the display apparatus 100 may be depicting video content,such as a sitcom, as illustrated in FIG. 5. The user 130 may issue afirst search command 505 to the display apparatus 100 to have thedisplay apparatus 100 search for a particular type of video content. Forexample, the user 130 may simply speak out loud, “show.” In this case,the natural language processor implemented by the CPU 150 alone or incooperation with the AI processor 165 may determine the meaning of thevoice command. In addition or alternatively, data associated with thevoice command may be communicated to the support server 127 which maythen ascertain the meaning of the voice command and convey thedetermined meaning back to the display apparatus.

At block 405, the display apparatus 100 may determine video content thatis related to the first search command 505. In this regard, the CPU 150alone or in cooperation with the AI processor 165 may implement variousmachine learning techniques that utilize metadata associated with thevideo content to determine video content that is related to the searchcommand. In addition or alternatively, the first search command 305 maybe communicated to the support server 127 and the support server 127 maydetermine and convey information related to the video content that is inturn related to the first search command to the display apparatus 100.

At block 410, the user interface 500 may be updated to depict controls520 that facilitate selecting video content. Each control may include aunique identifier 525 on or near the control 520 that facilitatesselecting the control by voice. For example, a first control with theunique identifier “one” may correspond to an image that represents aninput source of the display apparatus 100 that facilitates selectingvideo content from the input source. A second control with the uniqueidentifier “two” may correspond to an image of an actor that, whenselected, facilitates selecting video content that includes the actor. Afourth control with the unique identifier “four” may correspond to ascene from a movie that the user frequently watches or that isassociated with types of shows the user 130 watches.

The machine learning techniques may determine the type of control todisplay based at least in part on a history of search commands andselections specified by the user that may be stored in the supportdatabase 153 of the display apparatus 100 or maintained within thesupport server 127. In some implementations, the support database 153 isdynamically updated to reflect the user's choices to improve therelevancy of the controls displayed to the user for subsequent request.

At block 415, the user 130 may specify a second search command thatspecifies one of the unique identifiers. For example, the user 130 mayspecify “four” to select the scene associated with the fourth image 520.

At block 420, video content associated with the specified uniqueidentifier (e.g., “four”) may be depicted on the user interface 500 ofthe display apparatus 100.

Returning to block 400, in some implementations, the user 130 may refinea search command by specifying additional information. For example, inresponse to receiving the first search command at block 400, at block425, one or more potential second search commands 515 related to thefirst search command 505 may be determined. The machine learningtechniques implemented by the CPU 150, AI processor 165, and/or thesupport server 127 may be utilized to determine the potential commandsrelated to the first search command 505. As noted earlier, the metadatain the video content may include information that facilitatesdetermining whether the video content is associated with a particulartype of video content (e.g., comedy, drama, sports, etc.). This metadatamay be utilized by the machine learning techniques in determiningpotential second search commands related to a given first searchcommand.

At block 430, the user interface 500 may be updated to depict one ormore of the potential search commands 515, as illustrated in FIG. 5. Forexample, in response to the first scene command “show,” the potentialsearch commands 515 “games”, “action movies”, etc. may be determined anddisplayed.

As described earlier, in some implementations, the user interface 500may include a phrase control 510 that is updated in real-time to depicttext associated with the commands issued by the user.

At block 435, the user 130 may issue one of the potential searchcommands 515 to instruct the display apparatus 100 to search for varioustypes of video content. For example, the user 130 may simply speak outloud, “action movies.” The phrase control 510 may be updated inreal-time to depict text associated with the first search command 505and the second search command 515 (e.g., “show action movies”).

The operations may repeat from block 405. For example, in response tothe second search command, the display apparatus 100 may determine videocontent that is related to the first and second search commands anddisplay appropriate controls for selection by the user.

FIG. 6 illustrates exemplary operations for determining informationrelated to images in video content. The operations of FIG. 6 are betterunderstood with reference to FIGS. 7A and 7B.

At block 600, the display apparatus 100 may be depicting video content,such as a movie, as illustrated in FIG. 7A. The user 130 may issue afirst query 505 to the display apparatus 100 to have the displayapparatus 100 provide information related to the query. For example, theuser 130 may simply speak out loud, “who is on screen.” In this case,the natural language processor implemented by the CPU 150 and/or AIprocessor 165 may determine the meaning of the voice command. Inaddition or alternatively, data associated with the voice command may becommunicated to the support server 127 which may then ascertain themeaning of the voice command and convey the determined meaning back tothe display apparatus 100.

At block 605, in response to the first query 505, the display apparatus100 may determine one or more objects of the image associated with thequery 505. In this regard, the CPU 150 alone or in cooperation with theAI processor 165 may implement various machine learning techniques thatutilize metadata associated with the video content to determinedifferent objects being depicted on the user interface 700 of thedisplay apparatus 100. In addition or alternatively, the first query 505may be communicated to the support server 127 and the support server 127may determine and convey information related to different objectsdepicted on the user interface 700 to the display apparatus 100.

At block 610, the user interface 700 of the display apparatus 100 may beupdated to depict controls 720 that facilitate selecting differentobjects. Each control may include a unique identifier 725 on or neareach control 720 that facilitates selecting the control by voice. Forexample, controls for each actor may be depicted in the user interface700.

At block 615, the user 130 may select one of the unique identifiers 725.For example, the user 130 may specify “two” to select a particularactor.

At block 620, the user interface 700 may be updated to depictinformation related to the selection. For example, as illustrated inFIG. 7B, an informational control 730 with information related to theselected actor may be provided.

Returning to block 605, in some implementations, the user 130 may refinea the query by specifying additional information. For example, inresponse to receiving the first query at block 600, at block 625, one ormore potential second queries 715 related to the first query 705 may bedetermined. The machine learning techniques implemented by the CPU 150and/or the support server 127 may be utilized to determine the potentialqueries related to the first query 705. Metadata in the video contentmay be utilized by the machine learning techniques in determiningpotential queries related to a given first search query.

At block 630, the user interface 500 may be updated to depict one ormore of the potential queries 715, as illustrated in FIG. 7A. Forexample, in response to the first scene command “who is on screen,” thepotential queries “other movies by john doe”, “where was it filmed”,etc. may be determined and depicted.

As described earlier, in some implementations, the user interface 700may include a phrase control 710 that is updated in real-time to depicttext associated with the commands issued by the user.

At block 635, the user 130 may indicate a second query that correspondsto one of the potential queries 615 to instruct the display apparatus100 to depict information related to the query. The phrase control 610may be updated in real-time to depict text associated with the firstquery 605 and the second query.

At block 640, objects related to the second query may be determined andincluded with or may replace the objects previously determined. Then theoperations may repeat from block 610.

FIG. 8 illustrates alternative exemplary operations for determininginformation related to images in video content. The operations of FIG. 6are better understood with reference to FIGS. 9A and 9B.

At block 800, the display apparatus 100 may be depicting video content,such as a sitcom, as illustrated in FIG. 9A. The user 130 may issue acommand to the display apparatus 100 to pause the video content so thata still image is depicted on the user interface 900.

At block 805, the display apparatus 100 may determine one or moreobjects of the image. In this regard, the CPU 150 alone or incooperation with the AI processor 165 may implement various machinelearning techniques that utilize metadata associated with the videocontent to determine different objects being depicted in the stillimage. In addition or alternatively, the still image may be communicatedto the support server 127 and the support server 127 may determine andconvey different objects being depicted in the still image to thedisplay apparatus 100.

At block 810, the user interface of the display apparatus 100 may beupdated to depict controls 920 that facilitate selecting differentobjects, as illustrated in FIG. 9A. For example, controls 920 may beprovided for selecting an advertisement related to one of the objects inthe still image, to share the video content, to rate the video content,to display information related to one of the objects. Controls 920 forother aspects may be provided.

Each control 920 may include a unique identifier on or near the control920 that facilitates selecting the control by voice.

At block 815, the user 130 may select one of the unique identifiers. Forexample, the user 130 may specify the unique identifier associated witha control depicting a handbag that corresponds to a handbag shown in thestill image.

At block 820, the user interface 900 may be updated to depictinformation related to the selection. For example, as illustrated inFIG. 9B, an informational control 925 with information related to theselection may be provided. In one implementation, the informationalcontrol 925 may depict a QR code associated with a URL that may beutilized to find out more information related to the selection. The QRcode facilitates navigation to the URL by scanning the QR code with anappropriate application on, for example, a mobile device.

FIG. 10 illustrates alternative exemplary operations for automaticallypausing video content. The operations of FIG. 10 are better understoodwith reference to FIGS. 11A and 11B.

At block 1000, the display apparatus 100 may determine whether a user isin proximity of the display apparatus 100. For example, in oneimplementation, the imager 151 of the display apparatus 100 may captureimages in front of the display apparatus. The CPU 150 alone or incooperation with the AI processor 165 may control the imager 151 tocapture an image, analyze the captured image to identify face data inthe image, and compare the face data with face data associated with theuser 130 to determine whether the user 130 is in proximity of thedisplay apparatus. In this regard, face data associated with the user130 may have been previously captured by the display apparatus 100during, for example, an initial setup routine. The face data may havebeen stored to the support database 153.

In another implementation, near field communication circuitry of thedisplay apparatus 100 may be utilized to detect the presence of a devicein proximity to the display apparatus, carried by a user 130, that hasnear field communication capabilities. The device may have been previousregistered with the display apparatus 100 as belonging to a particularuser. Registration information may be stored to the support database153.

At block 1005, if the user is determined to not be in proximity of thedisplay apparatus 100, then at block 1010, if the video content is notalready paused, the video content may be paused, as illustrated in FIG.11A. Referring to FIG. 11A, a status control 1105 may be depicted on theuser interface 1100 to indicate that the video content has been paused.

In some implementations, the user interface 1100 may depict additionaldetails related to a still image depicted on the user interface 1100such as the information described above in relation to FIGS. 9A and 9B.

If at block 1005, the user 130 is determined to be in proximity of thedisplay, then at block 1015, if the video content is not alreadyresumed, the video content may be resumed, as illustrated in FIG. 11B.Referring to FIG. 11A, the status control 1105 may be updated toindicate that the video content will be resuming.

In some implementations, the display apparatus 100 may perform theoperations above even when other users 130 are in proximity of thedisplay apparatus 100. For example, in an initial state, a number ofusers 130 that includes a primary user 130 may be in proximity of thedisplay apparatus. When the primary user is subsequently determined tonot be in proximity of the display apparatus, the video content may bepaused, as described above. When the primary user is subsequentlydetermined to be in proximity of the display apparatus, the videocontent may be resumed.

FIG. 12 illustrates alternative exemplary operations for automaticallypausing video content. The operations of FIG. 12 are better understoodwith reference to FIG. 13A-13D.

At block 1200, the display apparatus 100 may determine whether a user isin proximity of the display apparatus. For example, in oneimplementation, the imager 151 of the display apparatus 100 may captureimages in front of the display apparatus. The CPU 150 alone or incooperation with the AI processor 165 may control the imager 151 tocapture an image, analyze the captured image to identify face data inthe image, and compare the face data with face data associated the userto determine whether the user is in proximity of the display apparatus100. As noted above, face data associated with the user 130 may havebeen previously captured by the display apparatus 100 during, forexample, an initial setup routine.

In another implementation, the presence of the user 130 may bedetermined based on near field communication circuitry of a devicecarried the user 130, as described above.

At block 1205, if a user is determined to be in proximity of the displayapparatus 100, then one or more program types associated with the user130 are determined. In this regard, the CPU 150 alone or in cooperationwith the AI processor 165 may implement various machine learningtechniques to determine program types associated with the user 130. Inaddition or alternatively, information that identifies the user 130 maybe communicated to the support server 127 and the support server 127 maydetermine program types associated with the user. The machine learningtechniques may determine the program types associate with the user 130by, for example, analyzing a history of programs viewed by the user 130,by receiving information from social media servers 120 related to likesand dislikes of the user, and/or by another manner.

At block 1210, programs that are available for watching at the time ofuser detection or within a predetermined time later (e.g., 30 minutes)may be determined. For example, metadata associated with available videocontent may be analyzed to determine whether any of the video content isrelated to the user associated program types determined above.

At block 1215, the user interface 1300 may be updated to presentinformation 1305 related to available programs that match the userassociated program types. The user interfaces 1300 may include controlsthat facilitate watching one of the available programs, recording theavailable programs, etc.

In some implementations, a group of users 130 may be detected withinproximity of the display apparatus 100 and the program types determineat block 1205 may be based on the intersection of program typesassociated with two or more of the users 130. The user interface 1300may be updated to depict information 1305 related to available programsthat match the intersection of user associated program types.

In certain implementations, the operations above may be performedspontaneously when a user 130 is detected. For example, a first user 130may be viewing video content on the display apparatus 100 when a seconduser comes within proximity of the display apparatus. The operationsperformed above may occur after detection of the second user.

In other implementations, the operations above may be performedimmediately after powering on the display apparatus 100.

In yet other implementation, the operations may be performed after apower off indication has been received. For example, as illustrated inFIG. 13B, the display apparatus 100 may either power up after havingbeen off or may cancel a power off operation, and the user interface1300 may be updated to depict a minimal amount of information so as notto cause too much of a distraction. For example, the user interface 1300may merely depict an informational control 1305 to make the user 130aware of, for example, an upcoming program. A control 1310 may beprovided to allow the user 130 to bring the display apparatus 100 intofully powered up condition to facilitate watching the program.

In yet other implementations, one or more information types associatedwith the user 130 may be determined and the user interface 1300 may beupdated to depict information associated with the determined informationtypes. For example, as illustrated in FIG. 13C, the user 130 may havebeen determined to be interested in knowing the weather. In this case,the display apparatus 100 may be powered up in a minimal power state andan informational control 1305 that displays information related to theweather may be depicted. Or the informational control 1305 may beupdated to display information related to an upcoming televisionepisode, as illustrated in FIG. 13D. After a pre-determined time (e.g.,1 minute) the display apparatus 100 may power down.

FIG. 14 illustrates exemplary operations for adjusting various smartappliances based on a detected routine of a user 130. The operations ofFIG. 14 are better understood with reference to FIG. 15A-15B.

At block 1400, the display apparatus 100 may receive data that relatesthe state of various smart appliances 117 and display apparatus 100usage. For example, data that relates light switches, timers, draperycontrollers, and other smart appliances 117 that were previously relatedto display apparatus 100 usage may be received. In this regard,communication circuitry of the display apparatus 100 may continuouslyreceive state information from smart appliances 117. The supportdatabase 153 may store the state information of the smart appliances 117along with usage information of the display apparatus 100. The CPU 150may correlate the state information of the smart appliances 117 and theusage information of the display apparatus 100 to form a relationbetween the state of the smart appliances and the display apparatususage. The relation may be indicative of a routine that the user 130follows in watching video content on the display apparatus 100.

The state information may define an activation state of the smartappliance 117. For example, whether a smart light was on, off, or dimmedto a particular setting such as 50%. Other information may includewhether smart drapes were closed, partially closed, etc. The usageinformation may define times of usage of the display apparatus, programtypes viewed on the display apparatus, lists of specific users of thedisplay apparatus, and specific characteristics of the display apparatus100 such as volume, contrast, and brightness of the display apparatus,etc.

At block 1405, the display apparatus usage may be determined, and atblock 1410, corresponding states for one or more smart appliances 117may be determined based on the received data. For example, the displayapparatus usage may indicate that the display apparatus 100 is set to amovie channel, that the picture controls have been set to a cinema modeand that the display apparatus 100 is being used in the evening on aFriday night. The smart appliance state/display apparatus usagecorrelation data may indicate that under these conditions, the lights ofthe room where the display apparatus 100 is located are typically offand that the blinds are closed.

At block 1415, the state of the various smart appliances may be setaccording to the state determined at block 1410. For example, the CPU150 may, via the communication circuitry of the display apparatus 100,adjust the various smart appliances 117.

As illustrated in FIG. 15A, the user interface 1500 may include aninformational control 1505 to notify the user 130 that a routine wasdetected. For example, the user interface 1500 may note that that thedisplay apparatus 100 is in a “picture mode” and that a smart bulb iscontrolled when the display apparatus 100 is in this mode. Asillustrated in FIG. 15B, the user interface 1500 may be updated toprovided details related to the detected routine such as a name assignedfor the routine (e.g., “Movie Time 8 PM”), a time when the mode “picturemode” was entered (e.g., 8:01 PM) and a setting to set the smartappliance to. (E.g., 10%).

FIG. 16 illustrates a computer system 1600 that may form part of orimplement the systems, environments, devices, etc., described above. Thecomputer system 1600 may include a set of instructions 1645 that theprocessor 1605 may execute to cause the computer system 1600 to performany of the operations described above. The computer system 1600 mayoperate as a stand-alone device or may be connected, e.g., using anetwork, to other computer systems or peripheral devices.

In a networked deployment, the computer system 1600 may operate in thecapacity of a server or as a client computer in a server-client networkenvironment, or as a peer computer system in a peer-to-peer (ordistributed) environment. The computer system 1600 may also beimplemented as or incorporated into various devices, such as a personalcomputer or a mobile device, capable of executing instructions 1645(sequential or otherwise) causing a device to perform one or moreactions. Further, each of the systems described may include a collectionof subsystems that individually or jointly execute a set, or multiplesets, of instructions to perform one or more computer operations.

The computer system 1600 may include one or more memory devices 1610communicatively coupled to a bus 1620 for communicating information. Inaddition, code operable to cause the computer system to performoperations described above may be stored in the memory 1610. The memory1610 may be a random-access memory, read-only memory, programmablememory, hard disk drive or any other type of memory or storage device.

The computer system 1600 may include a display 1630, such as a liquidcrystal display (LCD), a cathode ray tube (CRT), or any other displaysuitable for conveying information. The display 1630 may act as aninterface for the user to see processing results produced by processor1605.

Additionally, the computer system 1600 may include an input device 1625,such as a keyboard or mouse or touchscreen, configured to allow a userto interact with components of system 1600.

The computer system 1600 may also include a disk or optical drive unit1615. The drive unit 1615 may include a computer-readable medium 1640 inwhich the instructions 1645 may be stored. The instructions 1645 mayreside completely, or at least partially, within the memory 1610 and/orwithin the processor 1605 during execution by the computer system 1600.The memory 1610 and the processor 1605 also may includecomputer-readable media as discussed above.

The computer system 1600 may include a communication interface 1635 tosupport communications via a network 1650. The network 1650 may includewired networks, wireless networks, or combinations thereof. Thecommunication interface 1635 may enable communications via any number ofcommunication standards, such as 802.11, 802.12, 802.20, WiMAX, cellulartelephone standards, or other communication standards.

Accordingly, methods and systems described herein may be realized inhardware, software, or a combination of hardware and software. Themethods and systems may be realized in a centralized fashion in at leastone computer system or in a distributed fashion where different elementsare spread across interconnected computer systems. Any kind of computersystem or other apparatus adapted for carrying out the methods describedherein may be employed.

The methods and systems described herein may also be embedded in acomputer program product, which includes all the features enabling theimplementation of the operations described herein and which, when loadedin a computer system, is able to carry out these operations. Computerprogram as used herein refers to an expression, in a machine-executablelanguage, code or notation, of a set of machine-executable instructionsintended to cause a device to perform a particular function, eitherdirectly or after one or more of a) conversion of a first language,code, or notation to another language, code, or notation; and b)reproduction of a first language, code, or notation.

While methods and systems have been described with reference to certainembodiments, it will be understood by those skilled in the art thatvarious changes may be made and equivalents may be substituted withoutdeparting from the scope of the claims. Therefore, it is intended thatthe present methods and systems not be limited to the particularembodiment disclosed, but that the disclosed methods and systems includeall embodiments falling within the scope of the appended claims.

We claim:
 1. A device comprising: a display interface; a user inputinterface; a memory for storing instructions; and a processor incommunication with the memory, the user input interface, and the displayinterface, wherein the processor is configured to execute theinstructions to: display a video in the display interface; receive auser request as a voice signal from the user input interface forinformation about one or more objects displayed in a scene of the video;recognize the user request from the voice signal; determine the one ormore objects in the scene based on information related to the one ormore objects depicted on the display interface sent from a supportserver; and in response to the user request: display an image associatedwith the one or more objects and control icons of the one or moreobjects in a first subsection of the display interface overlaying thevideo; receive a user selection from the user input interface of one ofthe one or more objects; and display additional information related tothe selected object.
 2. The device of claim 1, wherein the processor isconfigured to execute the instructions to display an identifier on ornear the control icons of the one or more objects.
 3. The device ofclaim 1, wherein the one or more objects comprise facial information ofone or more characters in the scene.
 4. The device of claim 1, whereinthe one or more objects comprise item information in the scene.
 5. Thedevice of claim 1, wherein the one or more objects comprise informationof one or more places in the scene.
 6. The device of claim 1, whereinthe processor is configured to execute the instructions to display acontrol for selecting an advertisement related to one of the one or moreobjects in the scene in response to a second user request input from theuser input interface.
 7. The device of claim 6, wherein the processor isconfigured to execute the instructions to display a unique identifier onor near the control for selecting the advertisement facilitated by voicecontrol.
 8. The device of claim 7, wherein the processor is configuredto execute the instructions to display an informational control withinformation related to the selection of the advertisement.
 9. The deviceof claim 7, wherein the processor is configured to execute theinstructions to display a QR code associated with a URL pointing to anetwork location for finding more information related to the selectionof the advertisement.
 10. A method for interactive control of a displaydevice, comprising: displaying a video in a display interface; receivinga user request as a voice signal from a user input interface forinformation about one or more objects displayed in a scene of the video;recognizing the user request from the voice signal; determine the one ormore objects in the scene based on information related to the one ormore objects depicted on the display interface sent from a supportserver; and in response to the user request: displaying an imageassociated with the one or more objects and control icons of the one ormore objects in a first subsection of the display interface overlayingthe video; receiving a user selection from the user input interface ofone of the one or more objects; and displaying additional informationrelated to the selected object.
 11. The method of claim 10, wherein themethod further comprising displaying an identifier on or near controlicons of the one or more objects.
 12. The method of claim 10, whereinthe one or more objects comprise facial information of one or morecharacters in the scene.
 13. The method of claim 10, wherein the one ormore objects comprise item information in the scene.
 14. The method ofclaim 10, wherein the one or more objects comprise information of one ormore places in the scene.
 15. The method of claim 10, wherein the methodfurther comprising: displaying a control for selecting an advertisementrelated to one of the one or more objects in the scene in response to asecond user request input from the user input interface.
 16. The methodof claim 15, wherein the method further comprising: displaying a uniqueidentifier on or near the control for selecting the advertisementfacilitated by voice control.
 17. The method of claim 16, wherein themethod further comprising: display an informational control withinformation related to the selection of the advertisement.
 18. Themethod of claim 16, further comprising: displaying a QR code associatedwith a URL pointing to a network location for finding more informationrelated to the selection of the advertisement.